首页> 外文OA文献 >StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
【2h】

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

机译:stackGaN:使用stacked实现逼真的图像合成文本   生成性对抗网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Synthesizing high-quality images from text descriptions is a challengingproblem in computer vision and has many practical applications. Samplesgenerated by existing text-to-image approaches can roughly reflect the meaningof the given descriptions, but they fail to contain necessary details and vividobject parts. In this paper, we propose Stacked Generative Adversarial Networks(StackGAN) to generate 256x256 photo-realistic images conditioned on textdescriptions. We decompose the hard problem into more manageable sub-problemsthrough a sketch-refinement process. The Stage-I GAN sketches the primitiveshape and colors of the object based on the given text description, yieldingStage-I low-resolution images. The Stage-II GAN takes Stage-I results and textdescriptions as inputs, and generates high-resolution images withphoto-realistic details. It is able to rectify defects in Stage-I results andadd compelling details with the refinement process. To improve the diversity ofthe synthesized images and stabilize the training of the conditional-GAN, weintroduce a novel Conditioning Augmentation technique that encouragessmoothness in the latent conditioning manifold. Extensive experiments andcomparisons with state-of-the-arts on benchmark datasets demonstrate that theproposed method achieves significant improvements on generating photo-realisticimages conditioned on text descriptions.
机译:从文本描述中合成高质量图像是计算机视觉中一个具有挑战性的问题,并且具有许多实际应用。现有的文本到图像方法生成的样本可以大致反映给定描述的含义,但是它们无法包含必要的细节和生动的对象部分。在本文中,我们提出了堆栈式生成对抗网络(StackGAN)来生成以文本描述为条件的256x256逼真的图像。我们通过草图细化过程将难题分解为更易于管理的子问题。 Stage-I GAN根据给定的文本描述来绘制对象的原始形状和颜色,从而生成Stage-I低分辨率图像。 Stage-II GAN将Stage-I的结果和文字描述作为输入,并生成具有逼真的细节的高分辨率图像。它能够纠正第一阶段结果中的缺陷,并在优化过程中添加引人注目的细节。为了提高合成图像的多样性并稳定对条件GAN的训练,我们引入了一种新型的条件增强技术,该技术可增强潜在条件流形中的平滑度。在基准数据集上进行的大量实验和与最新技术的比较表明,所提出的方法在生成以文本描述为条件的逼真的图像方面取得了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号